Evaluation of Automatic Legal Text Summarization Techniques for Greek Case Law
نویسندگان
چکیده
The increasing amount of legal information available online is overwhelming for both citizens and professionals, making it difficult time-consuming to find relevant keep up with the latest developments. Automatic text summarization techniques can be highly beneficial as they save time, reduce costs, lessen cognitive load professionals. However, applying these documents poses several challenges due complexity lack needed resources, especially in linguistically under-resourced languages, such Greek language. In this paper, we address automatic documents. A major challenge area suitable datasets response, developed a new metadata-rich dataset consisting selected judgments from Supreme Civil Criminal Court Greece, alongside their reference summaries category tags, tailored purpose automated document summarization. We also adopted state-of-the-art methods abstractive extractive conducted comprehensive evaluation using human metrics. Our results: (i) revealed that, while exhibit average performance, generate moderately fluent coherent text, but tend receive low scores relevance consistency metrics; (ii) indicated need metrics that capture better summary’s coherence, relevance, consistency; (iii) demonstrated fine-tuning BERT models on specific upstream task significantly improve model’s performance.
منابع مشابه
Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کاملCombining Different Summarization Techniques for Legal Text
Summarization, like other natural language processing tasks, is tackled with a range of different techniques particularly machine learning approaches, where human intuition goes into attribute selection and the choice and tuning of the learning algorithm. Such techniques tend to apply differently in different contexts, so in this paper we describe a hybrid approach in which a number of differen...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملKeyphrase Based Evaluation of Automatic Text Summarization
The development of methods to deal with the informative contents of the text units in the matching process is a major challenge in automatic summary evaluation systems that use fixed n-gram matching. The limitation causes inaccurate matching between units in a peer and reference summaries. The present study introduces a new Keyphrase based Summary Evaluator (KpEval) for evaluating automatic sum...
متن کاملRepresenting discourse for automatic text summarization via shallow NLP techniques
In this thesis I have addressed the problem of text summarization from a linguistic perspective. After reviewing some work in the area, I have found that many satisfactory approaches to text summarization rely on general properties of language that are reflected in the surface realization of texts. The main claim of this thesis is that some general properties of the discursive organization of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information
سال: 2023
ISSN: ['2078-2489']
DOI: https://doi.org/10.3390/info14040250